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/In  OvtrvUw  cf  OWL 


Abstract 

We  describe  the  motivation  and  overall  organization  of  the  OWL  language  for 
knowledge  representation.  OWL  consists  of  a memory  of<  concepts  in  terms  of  which  all 
English  phrases  and  all  knowledge  of  an  application  domain  are  represented,  a theory  of 
English  grammar  which  tells  how  to  map  English  phrases  into  concepts,  a parser  to 
perform  that  mapping  for  individual  sentences,  and  an  interpreter  to  carry  out  procedures 
which  are  written  in  the  same  represenutlonal  formalism.  The  system  has  been  applied  to 
the  study  of  interactive  dialogs,  explanations  of  its  own  reasoning,  and  question  answering. 

Keywords: 

Artificial  Intelligence,  Knowledge  Representation,  LMS,  Memory  Structures, 
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An  Overview  of  OWL 


I 


I.  Overview  and  Motivation 


We  have  undertaken  the  design  and  implementation  of  a new  computer  language 
for  knowledge  representation,  called  OWL.  We  have  become  convinced  that  recent  progreu 
in  Linguistics  and  in  Artificial  Intelligence  (AI)  now  suggests  a set  of  principles  which  are 
worth  implementing  as  part  of  a programming  language  to  make  them  uniformly  accessible 
for  our  further  work. 

For  a computer  program-as  for  a person-K  is  more  effective  to  know  how  to  do 
something  than  to  be  able  to  figure  it  out.  The  AI  field  has  made  important  progress  under 
an  opposite  set  of  assumptions:  that  all  knowledge  of  the  domain  should  be  expressed  in 
propositional  form  and  that  a program’s  actions  should  be  directed  by  a general-purpose 
problem  solver  operating  on  propositions  representing  the  application  world.  Such  a 
problem  solver  would  always  figure  out  what  to  do  next  based  on  the  state  of  the  world 
and  its  set  of  general  principles.  At  the  same  time,  most  programs  which  have  been  used 
for  their  ability  to  perform  In  an  application  domain  rather  than  for  their  pedagogic  clarity 
have  used  a very  different  form  of  organization;  the  knowledge  of  how  to  perform  the 
task  was  implicitly  built  into  the  steps  of  the  program.  Of  course,  such  an  organization  is 
generally  accompanied  by  Inflexibility,  difficulty  of  extension.  Incomprehensibility  and 
unprovability  of  the  program,  and  many  other  Ills.  If,  however,  we  could  express  the 
description  of  the  procedural  knowledge  of  the  program  in  the  same  formalism  as  Its 
declarative  knowledge  of  the  domain  of  application,  then  both  would  be  equally  accessible. 
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This  is  precisely  what  is  done  in  OWL-*the  program  is  Just  another  aspect  of  the 
description  of  the  application  world,  and  knowledge  of  how  to  solve  specific  problems  of 
that  world  can  be  explicitly  embedded  in  the  description. 


We  have  taken  English  as  the  basis  for  our  knowledge  representation  formalism. 
The  greatest  attraction  of  this  approach  is  that  it  almost  trivially  satisfies  our  need  for 
expressive  power.  After  all,  native  speakers  of  English  can  usually  communicate  their 
knowledge  of  any  oomain  of  interest  in  English,^  perhaps  augmented  by  specialized 
notations  and  vocabularies  particular  to  the  domain.  Because  we  choose  a computer 
representation  which  is  designed  to  be  similar  to  the  natural  language  employed  by  a 
computer -naive  user  of  one  of  our  programs,  we  expect  that  the  translation  process  from 
English  sentences  to  our  internal  structures  will  be  straightforward.  Once  we  succeed  In 
translating  the  English  phrase  into  our  internal  representation,  that  will  allow  all  of  OWL's 
activities,  including  understanding  the  sentence  in  semantic  detail,  resolving  references, 
mapping  the  sentence  onto  some  capability  of  the  system  for  acquiring  new  knowledge  or 
answering  on  the  basis  of  old,  etc.,  to  make  use  of  the  same  representational  formalism. 
This,  in  turn,  will  help  us  to  make  the  complete  operation  of  the  program  accessible  for 
explanation  to,  and  modification  by,  someone  who  may  well  understand  the  domain  of 
application  but  not  our  computer  technology. 


^ We  limit  our«r!»e«  to  *left-hemitphere  knowledfe,*  which  doe*  not  include  visual  ckill*  or 
manipulative  skill*  where  local  muscle/nerve  training  is  an  escential  component.  Tfin*,  our  domains 
are  restricted  to  reasoning  tasks  where  the  necessary  data  about  a problem  can  be  acquired 
verbally;  c.g.,  mcdiral  diagnosis  and  treatment  of  the  type  which  could  be  done  by  consultation  over 
the  telephone  (probably  not,  for  eiample,  diagnosis  of  skin  disease,  where  visual  inspection  is  a 
critical  skill),  automatic  program  w-iting,  question  answering. 
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Arguments  for  English  as  a programming  language  have  been  made  since  the 
earl7  1960’s,  yet  it  has  not  been  universally  acclaimed  as  desirable.  The  principal  objections 
to  basing  a programming  language  on  English  (or  any  natural  language)  center  on  the 
Innate  ambiguity  of  natural  language  and  its  lack  of  conciseness  when  contrasted  with 
special  mathematical  notations.  The  second  problem  is  rapidly  resolved  if  we  extend  our 
definition  of  natural  language  to  allow  the  incorporation  of  new  notations.  After  all,  the 
natural  language  of  a physics  text  is  hardly  the  literary  English  of  the  day.  The  first 
problem  has  both  a trivial  and  a difficult  component;  pure  syntactic  ambiguity,  as  created 
by  the  existence  of  homonyms  for  instance,  is  simply  controllable,  whereas  ambiguity  arising 
from  the  fact  that  what  one  (literally)  says  is  not  what  one  actually  means  is,  of  course, 
difficult  Our  response  is  simply  that  we  wish  to  begin  by  representing  precisely  what  one 
says,  and  we  will  allow  the  determination  of  the  meaning  of  each  utterance  to  be  part  of  the 
problem  that  the  system  is  to  solve. 

During  the  past  few  years,  we  have  implemented  the  following  componenu  of  a 
complete  system  based  generally  on  the  above  ideas; 

A Linguistic  Memory  System  (LMS)  [Hawkinson  19751  which  is  a memory  (data 
base)  of  concepts  in  which  all  knowledge  In  OWL  resides.  LMS  can  be 
viewed  as  a semantic  network,  with  a somewhat  unusual  interpretation  of  Its 
nodes  and  arcs. 

A theory  of  English  grammar  which  specifies  how  any  utterance  of  English  can 
be  represented  In  terms  of  LMS  concepts. 

A skeletal  world  model,  organized  as  a taxonomy  of  concepts,  and  Intimately 
related  to  the  theory  of  English  grammar. 

An  augmented  transition  network  parser  to  translate  English  utterances  into  their 
OWL  representations. 

A generator  to  perform  the  inverse  transformation  to  the  parser. 

An  Interpreter  which  carries  out  procedures  represented  In  the  OWL  formalism. 

An  explainer  which  provides  English  explanations  (via  the  generator)  of 
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procedures  and  data  dependencies  known  to  the  interpreter,  as  well  as  results 
of  previous  executions  of  those  procedures. 

These  components  are  at  differing  stages  of  development.  We  are  pursuing  a breadth-first 
approach  to  Implementation,  where  we  try  to  have  some  version  of  each  of  these 
components  before  trying  to  have  the  "ultimately"  correct  version  of  any  of  them. 

In  terms  of  the  above  componenu.  we  have  been  building  the  following  ! 

programs: 

Programwriter.  which  takes  a declarative  specification  of  simple  programs  which 
need  to  be  written  and  designs,  optimizes,  and  codes  them.  The  scope  of  its 
capabilities  includes  programs  to  maintain  bank  balances  and  sell  tickets  for 
scheduled  events  [Long  19T71 

Susie  Software,  which  Is  another  automatic  programmer,  for  writing  manipulation 
programs  for  the  blocks  world.  It  is  a research  environment  for  developing 
a discourse  model  which  lets  Susie  engage  the  user  In  a dialog  concerning 
the  program  it  is  trying  to  write  [Brown  19771 

Proctor,  a program  which  helps  a business  manager  to  design  a procurensent 
system.  It  Is  an  "unstructured"  questionnaire  which  provides  a framework 
for  a manager  to  think  about  his  system  requirements  [Bosyj  19761 

A Digitalis  Therapy  Advisor,  which  makes  clinical  judgments  about  the  condition 
of  a patient  who  is  receiving  the  drug  digitalis,  makes  further  therapeutic 
recommendations,  and  can  Interactively  explain  iu  reasoning  steps  to  the  user 
[Swartout  19771 

A question  answering  system  for  a relatively  simple  data  base. 

We  will  give  an  overview  of  LMS,  the  theory  of  grammar,  and  the  interpreter, 
and  discuss  other  modules  as  they  relate  to  those  central  components. 
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II.  The  Linsulstic  Memory  System 


The  OWL  LMS  is  a semantic  network  with  a single  primary  data  type,  the 
concopt,  and  a secondary  data  type,  the  sjmM.  Symbols  are  merely  strings  of  characters 
which  denote  senses  of  English  words  and  affixes  and  have  no  innate  significance. 
Concepts  represent  the  nteanings  of  all  words,  phrases,  clauses,  sentences,  etc  of  English  as 
well  as  any  needed  non-llnguistic  entitles.  It  is  very  important  to  note  that,  whereas  in  a 
traditional  semantic  network  each  node  of  the  network  represents  a single  word  or  item.  In 
LMS  each  node  represents  any  of  the  higher-level  constructions  mentioned  above.  Thus, 
where  a typical  semantic  net  would  identify  the  meaning  of  a sentence  as  some  subnet  of 
the  whole  network,  LMS  identifies  it  as  a single  node  of  the  network. 


ILA  The  Eetential  Structure  of  Concept  $ 


Concepts,  the  nodes  of  LMS,  havt  structurt.  In  fact,  we  will  concentrate  on  the 
essential  structure  of  a concept  as  the  primary  organizational  facility  of  LMS. 

Every  concept  is  defined  by  a pair,  (genus  specialiier),  the  essence  of  that  concept. 
The  genus  is  another  concept,  and  the  specializer  is  either  a concept  or  a symbol.  The 
g^us  specifies  the  general  type  of  the  concept;  if  the  genus  of  concept  C is  B (I.e.,  If 
C ■ (B  specUUlter)),  then  we  imply  that  C Is-a  B,  or  C is  a kind  of  B.*  C is  called  a 


* The  general  implication  of  (1-<S  or  U a hind  of  {AKO)  linka  it  that  "aomething"  (properties, 
featnree,  place  of  claaaification,  waya  to  treat,  etc.)  ia  Inherited  by  C from  B.  We  will  define  ihia 
more  preciaely  later. 
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spectalizatlon  of  B,  and  6 Is  called  a gtntrallzatlon  of  C.  The  ipeclalizer  serves  to 
distinguish  this  concept  from  all  other  concepts  with  the  same  genus;  it  does  not  by  itself 
define  the  concept*  The  genus  and  the  speclalizer  together  identif y a concept 
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We  want  ta  interpret  all  the  concepts  in  LMS  as  forming  a single  uxonomy  or 
tree-like  classification  system  in  which  the  genus  points  “up”  in  the  taxonomy.  To  do  so.  we 
must  designate  a single  concept,  SUhHUH-GENUS,  whose  genus  is  itself.  That  condition  makes 
SUtttUM-GENUS  the  root  of  the  tree.  Further,  we  Insist  that  no  loops  may  occur  in  the 
expression  of  concepts  in  terms  of  themselves  or  each  other  (with  the  above  exception  for 
SUMMUM-GENUS).  Then,  all  concepts  will  form  a tree  structured  classification;  starting  from 


( 


any  concept  in  the  conceptual  memory  and  successively  moving  to  its  genus  will  always  lead 
to  the  root  concept  SUWWM-GENtJS  in  a finite  number  of  steps.  That  number  will  be  called 
the  genus  depth  of  the  concept.  We  also  introduce  a notational  convenience.  So  far,  we 
have  only  allowed  a concept  to  be  written  as  (genus  speciallzer).  But  clearly,  the  depth  of 
parenthesization  for  writing  any  concept  will  be  at  least  Its  genus  depth,  and  this  Is  terribly 
inconvenient.  Thus,  we  allow  equivalence  declarations,  such  as  A * (B  C),  which  allows  any 


I 


i 


appearance  of  A to  sund  for  an  appearance  of  (B  C).  A is  called  the  label  of  (B  C).  \ 

] 

The  notion  of  derivative  subclassi/lcation  [Hawkinson  1975]  complicates  this  j 

picture  somewhat.  It  assures  that  all  specializations  of  a concept  are  classified  the  $an>e  way  \ 

i 

the  specializers  themselves  are  classified  In  the  conceptual  memory.  For  example,  if  In  the  j 

t 

i 

>i 

I 


* For  eiample,  we  may  represent  "dog  house*  as  (HOUSE  DOG)  and  "dof  Uii“  as  (TAIL  DOG),  and 
although  both  concepu  are  specialised  by  DOG,  they  are  clearly  different. 
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taxonomy  both  DOG  and  PIG  have  g;emis  ANIHAL,  then  we  classify  (TAIL  DOG)  and  (TAIL  PIG) 
under  (TAIL  ANIHAL).  The  gtntrallur  of  a concept  (A  ft)  Is  the  most  specific  specialization 
of  A whose  spedallzer  is  a generalization  of  ft,  or,  if  there  are  none  of  these.  Just  A itself.* 
The  genus  of  a concept  is  thus  always  either  its  generalizer  or  the  generalizer  of  lu 
gcneraUzer,  etc.  By  moving  along  the  successive  generalizers  from  any  concept,  we  must 
finally  reach  SUMHUN-GENUS,  and  the  number  of  steps  required  is  called  the  gtntrallur  dtptk 
of  the  concept 

We  have  now  deKrIbed  some  of  the  essential  structure  of  each  concept  thus  each 
node,  of  a conceptual  memory.  Before  we  turn  to  arguing  for  the  utility  of  this  structure  to 
represent  knowledge,  let  us  see  what  the  essential  structure  of  the  nodes  already  implies  for 
the  semantic  network  u a whole.  In  our  current  implementation,  every  concept  is  directly 
linked  to  its  generalizer  and  specializer.  Every  concept  it  not,  however,  linked  directly  to  its 
genus,  since  the  genus  can  easily  be  computed  from  generalizer  and  specializer  links.  A 
typical,  but  very  small,  conceptual  memory  taxonomy  is  shown  in  Figure  1. 

H.B  Attachment 

In  the  previous  section,  we  presented  the  essential  structure  of  a concept  in  LMS. 
The  act  of  creating  a new  node  in  LMS  is  called  speclalizattm,  and  we  say  that  we 


* An  intermediate  concept  in  the  taionomy,  inch  as  (TAIL  ANIHAL)  in  onr  eiample,  is 
antomatically  created  by  LMS  whenever  more  than  one  concept  may  be  classified  nnder  it.  Thus, 
the  feneraliser  of  a concept,  and  hence  the  nnmber  of  times  that  wo  need  to  move  from  a concept 
to  its  (eneraliser  in  order  to  reach  its  genns,  will  depend  dynsmically  on  what  other  cencepta  ato  in 
the  taxonomy. 
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Figure  !.  A Sample  (Conceptual  Memory  Taxonomy 

TTiit  fifar«  ihowt  the  claeejficetion  of  tome  of  the  concepU  used  in  thit  paper  into  a email 
conceptual  memory  Uionomy.  The  uionomy  it  a tree  which  it  ihown  in  the  figure  hy  auccectiTely 
indenting  branchee,  at  in  an  outline.  Note  that  deriratiye  anhclauification  eautee  the  anhtree 
under  TAIL  to  be  organised  in  a timilar  way  to  the  subtree  under  COUNT-NOUN.  Thit  sample  ia  of 
course  very  small  and  sparse;  the  taionomy  we  currently  use  has  nearly  three  thousand  concapta 
and  a correspondingly  more  complei  organisation. 
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spedaiize  a genus,  G,  bf  a speclalizer.  S.  to  form  the  concept  (6  S).  As  we  shall  argue,  any 
phrase  of  English  can  be  suitably  encoded  u a single  concept  (though  of  course  it  may  be  a 
very  complex  one).  When  we  wish  to  roasm  with  a concept,  however,  we  will  find  it 
convenient  to  introduce  an  epistemologically  distinct  meta-ltvtl  representation.  For  example, 
if  the  concept  C encodes  the  sentence  "John  Smith  is  a good  man"  and  we  wish  to  represent 
our  belief  that  C is  true,  we  cannot  merely  encode  with  0 that  "That  John  Smith  is  a good 
man  is  true,"  because  now  the  question  of  0’s  truth  is  open  for  discussion.’  We  retreat  to  a 
formal  meta-level  to  make  statements  about  elemenu  of  our  universe  of  discourse  which  are 
to  be  taken  at  face  value  rather  than  be  subject  to  interpreution.  With  such  an  ability  for 
meta-level  description,  we  see  that  if  C is  marked  as  TRUE  at  the  meta-level,  then  that  is  a 
stronger  statement  than  D.  From  the  former,  the  Interpreter  may  cotKiude  Cs  truth 
absolutely,  while  from  the  latter,  only  conditionally  on  0’s  truth. 

The  act  of  attacknunt  creates  a directed  Unk  in  LMS  between  two  nodes.  We 
write  [A  0]  and  say  that  B is  attached  to  A Unlike  specialiutlon,  attachment  creates  no  new 
concepts.  It  merely  establishes  an  (unlabelled)  link  from  A to  B.  The  meaning  of  that 
connection  will  depend  completely  on  what  A and  B are  and  on  whatever  is  interpreting  the 
connection.  We  give  a few  illustrative  examples  of  attachment  here 

All  concepts  B whose  generalizers  are  the  concept  A are  automatically  attached  to  A 

and  are  called  its  tndtxtd  branchts  because  they  are  classified  directly  under 

A in  the  specialization  taxonomy. 

’ It  is  not  nwrely  the  rapresontatien  of  truth  that  it  at  ittue  here.  A timilar  treatment  it 
neceaaary  for  auppoeition,  hypotheaia,  "poatible  futures,"  an4  in  fact  all  the  fundamental  knowledce 
on  the  baais  of  which  OWL  operates.  Of  course  the  effect  of  the  meta-lerel  statements  that  we 
allow  could  alternatively  be  introduced  by  suitable  conventions  for  the  Interpreter.  For  eiample, 
we  could  adopt  the  convention  that  any  statement  about  which  no  foalifying  information  is  known 
is  true.  We  prefer,  however,  to  make  such  a convention  part  of  the  Interpreter  and  not  part  of  the 
semantics  of  LMS. 


K) 
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Some  concepu  (C  A)  ire  ittached  to  the  concept  A and  are  called  iu  indtxod 
aspotts.  For  example.  (AGE  JOHN)  may  be  attached  to  JOHN  and  encodes  JOHlTs 
AGE  aspect 

Note  that  both  of  the  above  forms  of  attachment  are  easily  rccognixable  because  the 
concept  to  which  attachn>ent  is  made  appears  as  the  generalizer  or  specializer,  respectively, 
of  the  attached  concept  They  derive  from  the  essential  structure  of  concepu  and  serve 
much  the  same  purpose  for  the  conceptual  memory  as  do  index  entries  in  a book’s  index. 
These  attachmenU  do  not  really  bear  information;  they  are  established  when  the  toxonomy 
Is  built  and  are  not  subject  to  interpreution  or  change  Thus,  the  use  of  attachment,  a 
meta-level  operation,  is  appropriate 

Values  may  be  specified  by  attachment  e.g.,  [(AGE  JOHN)  49]. 

Attributes  or  descriptors  may  also  be  attached:  e.gn  [JOHN  HIOOLE-AGEOl 
[(AGE  JOHN)  (EQUAL  (TO  (AGE  NARY)))] 

Characterizations  may  also  be  specified  by  attachment  e.g.,  [JOHN  PROFESSOR]. 

This  second  set  exemplifies  storage  of  information  (facu  in  the  object  domain),  yet  we  are 
representing  such  Information  at  the  meta  level.  This  is  because  we  intend  that  reasoning 
be  based  on  these  facu  without  further  verification.  We  are  willing  to  guarantee  their 
truth  in  this  dontaln  of  application. 

HI.  How  Engllah  Ptiraf  • uru  ReprefiHud  n Conctptt 

In  this  section,  we  shall  first  argue  Informally  that  the  combination  of  concepu 
through  specialization  provides  a mechanism  capable  of  representing  English  phrases.  We 
will  then  extend  our  notion  of  specialization  to  deal  more  rigorously  with  some  problenu  we 


encounter. 
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► 
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IH.A  What  Poet  w Enalith  Exprettlon  Say? 


We  view  Engllih  phraies  ii  expreuioni  built  up  bf  combinttlon.  To  explore 
what  forms  of  combination  are  necesury,  we  examine  some  modts  of  eommurUeatUm  in 
English  and  see  how  they  are  achieved  by  combining  words  and  phrases. 

1}  DMignnting.  We  use  a conventional  name  for  a concept  which  the  listener 
may  be  assumed  to  know.  In  its  simplest  form,  the  conventional  name  is  a word  of  English, 
e.g.,  "apple,*  which  we  represent  in  OWL  by  APPLE  ■ (FRUIT  "APPLE").*  But  we  need  many 
more  conventional  names  than  we  have  words  in  our  language.  So,  we  permit  the 
formation  of  conventional  names  as  combinations  (pairs).  One  member  of  the  pair 
indicates  the  class  of  the  concept,  the  other  provides  a distinguishing,  or  specializing, 
element  to  make  the  pair  unique  For  example,  "apple  tree"  is  a conventional  name  formed 
by  specialiutlon.  In  LMS,  we  represent  it  u (TREE  APPLE).  Note  that  no  strong  distinction 
Is  made  between  conventional  names  that  are  compound  words  and  those  that  are  phrases 
in  English.  Compare  "fire  hydrant,"  (HYDRANT  FIRE),  and  "fireman*,  (HAN  FIRE). 

2)  Identifying.  We  identify  an  unnamed  concept  by  combining  its  clau  and 
some  (restricting)  nwdifiers.  For  example,  "tall  tree,*  (TREE  TALL),  and  "the  apple  tree  in  my 
yard*,  (((TREE  APPLE)  THE)  (IN  (YARD  NY))).'  The  difference  between  identifying  and 

* "APPLE"  ic  tii«  LMS  iMUtiott  f«r  tli«  trinbol  "apple*.  The  concept  (FRUIT  "APPLE")  ie  LMS’a 
notation  for  the  Enfliah  concept  apple. 

^ Sonw  linfoiata  might  feel  that  this  phrase  shoeM  hare  a different  atmctnre,  each  aa 
(((TREE  APPLE)  (IN  (YARD  HY)))  THE).  We  do  not  claim  to  hare  tho  final  answer  to  all  sndi 
atmctnral  gnostions,  het  oer  formalism  allows  ns  to  capitalise  on  whatever  inaiglits  Imgaiats  map 
hava.  Where  stmctaree  are  in  dispela,  we  have  chosen  what  seems  best  to  ns. 
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detlgnatlng  Is  often  slight.  In  designatlngi  we  auumc  that  the  hearer  already  knows  the 
concept,  whereas  in  Identify ing^^erksk  him  to  come  to  know  it  from  what  he  knows  of  itt 
component*  and  whatever  else  we  may  later  tell  him.  Thus,  a "shoe  tree,"  which  we  might 
initially  accept  as  art  identifying  compound  without  a conventional  designation,  may  come 
to  designate  a concept  if  shoe  trees  become  a popular  consumer  Item.  Just  as  compound 
words  develop  from  conventional  names  that  are  phrases,  the  latter  develop  from 
IJeiitifyiP"  phfiiU'S. 

3)  GpedfytnA  t grammatical  or  intarprativa  aapact.  Chiefly  by  word 
affixes,  Eng;iSh  I'larks  pmases  and  gives  clues  to  their  use  in  forming  sentences  and  to 
their  proper  inter  r.taJ.jn.  For  example,  frr  "books,"  (BOOK  -S),  the  -S  U a grammatical 
ir  arking  '5f  pk/fii  on  tie  bay?  concept  BOOK.  In  "hitting,"  (MIT  -IMG),  and  "to  Jump,"  ( JIW 
TO),  the  -I  U I u > j play  i similar  rote.  This  form  of  marking  is  called  irtfltctlon.  In  LMS, 
?nfl  is  expv  i v by  specialitlng  the  concept  to  be  inflected  by  the  affix  (or  other 
n/arkc't> 


% a j.”iu»,ilc  aspect.  We  also  represent  scnrtantlc  aspects  by 

'per  a»r;  m.  jrnar'pi  . ''’zt  7'' apple,"  (SIZE  APPLE). 


1 . . K r nt  to  say  rmething  about  an  object  or  action  in  a 

I u'e  u'"  predication.  Jeiperiin  [19331  calls  this  nexiu: 

F w cn..i  . I ’ll'  p’l/  di  '1  and  tl,t  barking  dog,  on  the  one  hand  (janction)  and  on 
l,  • ( >T  <.’  « L lid  -iiKj  Ihe  it  g hatU  or  Ike  dog  U barking  (nexntX  we  find  that 
.hi  ,V , ,P(‘  kiiir  \t  more  ^ or  bC'ii,  a,.d  the  .alter  nioro  pliahle;  There  ie,  aa  it  were, 

mor'3  life  in  it  n jMiictiu  >»  li^e  a picture,  a nexui  it  like  a drama  or  proceea.  In  a 
if  I..V  ..lilted  to  tis  co;iceptii Jt  contained  in  the  primary:  the  difference 
o ii  * I ..  t jror'ior  .1  e'en  ol'!arIy  bj  comparing,  r.g. 
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Tli«  blae  dreac  m Ui«  oldwU 

The  oMeet  drew  it  biee. 

A dmcinc  woman  charms. 

A cbarminf  woman  dancea. 

In  our  terms,  a Junction  Identifies  or  designates.  A nexus,  or  predication,  makes  a statement 
and  depends  on  interpretation  for  its  meaning. 

In  LMS,  we  Introduce  a new  notation  to  expreu  predication:  subjta  / prtdUott. 
For  example;  Jespersen’s  sentence  "the  oldest  dreu  is  blue”  becomes 
((DRESS  OLDEST)  THE)  / BLUE.  For  uniformity  of  representation  and  implemenutional 
oinvenience,  however,  we  will  implement  predication  in  LMS  using  specialization  by 
adopting  the  following  convention:  The  predication  A / B will  be  Implemented  as 
((B  NEXUS.)  A). 

6)  Hwnizition.  To  specify  a group  of  things  related  in  some  simple  way,  we 
itemize  thcn^  Particular  types  of  itemlution  are  sequences,  conjunctions,  disjunctions, 
sums,  products,  contrasting  pairs,  etc  For  example  "red,  white  and  blue,"  "S»5«9,"  and 
"Input/output"  are  all  itemizations.  LMS  introduces  an  external  noution  for  such 
itemizations  but  implements  them  by  a conventional  use  of  specialization  and  attachment 
The  detaib  are  unimportant  and  will  not  be  pursued  here  We  should  add.  however,  that 
we  fed  the  notion  of  sequence  to  be  fundamental 

7)  Nulling.  This  important  mechanism  of  English  will  play  a major  role  in  our 
representation  formalisra  Language  often  uses  context  to  say  concisely  what  might 
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otherwise  require  a verbose  specification.  In  particular,  we  often  use  part  of  a compound  to 
name  the  whole  6CNERAL  for  (OPPICER  GENERAL).  CAPITAL  for  (LETTER  CAPITAL),  and  EMPTY 
for  (CONTAINER  EMPTY).  In  each  of  these  cases,  the  specializer  in  context  names  the  whole 
concept  We  shall  encounter  more  general  uses  of  naming  below. 

I 

IHB  Kinds  of  Specialization 

Our  treatment  of  specialiutlon  as  outlined  above  is  inadequate  for  some  subtler 
issues  of  representation.  Although  we  have  identified  several  uses  of  compound  formation 
In  English  communication,  we  have  represented  them  aH  by  the  same  specialization 
operation.  We  form,  in  a completely  similar  manner,  compound  phrases  Hke  the  dog.”  (DOG 
THE),  "sheep  dog,*  (DOG  SHEEP),  "small  dog,"  (DOG  SHALL),  and  "dog  In  the  yard.”  j 

(DOG  (IN  (YARD  THE))).  For  these  examples,  no  problems  arise  because  we  can  recapture  | 

from  the  specializer  Itself  what  kind  of  compound  we  have  formed.  But  that  will  not  | 

generally  be  the  case,  as  we  shall  see  below.  In  this  section,  we  Introduce  seven  distinct  ■! 

I 

kinds  of  specialization  to  enrich  our  representation  scheme.  j 

*3 

The  English  phrase  "fat  man*  it  ambiguouv  In  its  common  meaning,  it  stands  i 

for  a man  who  is  overweight  to  some  degree.  The  same  phrase,  however,  also  describes  a | 

professional  circus  performer  of  great  girth,  with  whom  we  associate  characteristic  forms  of 
dress,  behavior,  etc.  In  terms  of  the  modes  of  communication  listed  above,  we  are  either  j 

(1)  designating  the  circus  performer  by  his  conventional  name  or  (2)  identifying  the  man  j 
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who  Is  overweight  by  hit  genus  and  a distinguishing  characteristic*  OWL  is  unique  in 
that  we  make  a proetdural  distinction  between  these  two  senses  of  ”fat  man.”  In  the  first 
case,  ”fat”  is  combined  with  ”man”  to  identify  a pattern  in  memory,  and  then  that  pattern  is 
used  to  find  the  referent  In  the  second  case,  ”man”  alone  is  used  to  find  a pattern  in 
memory,  and  then  items  which  match  this  pattern  are  further  checked  to  tee  if  they  pau 
the  pattern  designated  by  "fat”  We  could  imagine  a skinny  fat  man  only  in  the  first  sense, 
as  referring  to  the  drcut  performer.  But  our  representational  scheme,  as  presented  to  far, 
offers  only  (HMl  FAT)  for  "fat  man,”  and  fails  to  distinguish  the  two  senses  we  have 
discussed. 


To  preserve  the  desired  distinction  between  these  readings  of  "fat  man,"  we  will 
mark  every  spedaHzation  with  its  mta-tjpt,  which  indicates  the  relation  between  the 
concept  and  its  genus.*  We  will  represent  our  overweight  man  by  a restrictive 
spedaHiatlon.  (HAfhR  FAT).  A rtstrUUen  (A*R  6)  may  always  be  paraphrased  as  "an  A 
which  is  A,”  e.g.,  "a  man  who  is  fat,"  and  a restriction  always  represents  a concept  which  is  a 
kind  of  its  genus  with  the  additional  attribute  which  is  its  speciallier.  Note  that  a Ull  fat 
man,  ((NAIhR  FAT)«R  TALL)  is  not  the  same  u a fat  tall  man.  ((NAN*R  TALL)*R  FAT),  either 
toi  real  Hfe  or  in  conceptual  memory.  In  a sttrtotjp*,  (A*T  A),  the  spedalizer  has  sonte  close 


* In  sfokM  lMf«af«i  tli«  oomeoNiid  repreMntinf  Uie  conventional  nanw  it  tpokon  almoot  at  if  it 
warn  the  oomaoNiid  word  "fatman."  Thit  additional  oloo  it  not  available  to  vt  via  written 

lanfaafa 

* We  are  introdneing  a miner  inconaittancy  here,  beeaeta  we  change  the  meawiNg  ef  "geMe* 
lomtwbat  By  the  miet  of  LMS,  the  gannt  of  tba  concept  (AeR  A)  it  A*R,  yet  we  trill  refer  bare  !• 
A,  the  oancapt’t  UnguUtte  gtnus,  at  iu  genet. 
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relation  to  the  genus  but  Is  not  necessarily  a property  of  It  Consider  not  just  our  circus 
performer,  (mN*T  FAT),  but  also  (HY0RANT*T  FIRE),  where  the  relation  between  "fire*  and 
"hydrant"  Is  a complex  one  "a  hydrant  which  Is  a source  of  water  with  which  one  can  put 


The  seven  OWL  meta-types  and  their  notatlonal  suffixes  are 


«R  restriction 
•T  stereotype 
•S  species 
*I  instance 


•A  aspect 
•X  Inflection 
•P  partitive 


(A*S  B)  represents  a iubsptdos  of  A,  where  B Is  often  Just  a symbol.  This 
represenu  a LInnaean  classification  system  in  which  we  assume  that  different  subspecies  of 
A form  mutually  exclusive  categories.  This  is  a powerful  tool  for  daabase  search.  (A*I  B) 
represents  an  tnstanc*  of  A.  Instances,  as  species,  are  mutually  exclusive.**  We  thus 
provide  a distinction  between  classes  and  individuab  by  distingubhing  Instances  from 
spedes. 


An  aspect  specialization  (C*A  B)  Is  a kind  of  Its  genus  C,  which  is  closely 
associated  with  its  speciallzer  B.  For  example,  "height  of  John,"  (HEI6HT«A  JOHN)  and 
"John's  leg,”  (LEG«A  JOHN).  Aspecb  abo  play  the  traditional  role  of  programming  language 
variables.  For  example.  If  we  have  a recipe  for  pancakes  which  calb  for  one  egg.  that  egg 
will  be  represented  by  (EGG*A  (RECIPE*T  PANCAKE)) 


**  Some  syttemt  farther  divide  inetencee  into  HiMifestalioM:  "the  yoang  Chardiill."  We 

wrald  handle  this  os  (CHURCHILL*R  YOUNG),  where  CHURCHILL  ■ (NAN*I  "CHURCHILL"). 
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An  infltcttm,  (A»X  B),  Is  used  to  specify  a grammatJcal  or  Interpretive  aspect.  It 
has  the  unusual  behavior  that  It  Inherlu  properties  not  only  from  lu  genus,  as  all  other 
specialization  types  do.  but  also  from  Its  speclallzer.  In  fact,  properties  Inherited  from  the 
speclalizer  override  any  Inherited  from  the  genus.  For  example,  "books,"  (B00k*X  -S),  Is 
plural  even  though  BOOK  U singular,  because  -S  carries  the  plural  property. 

The  partitive,  (A*P  B),  is  Hke  a semantic  version  of  Inflection.  The  partitive 
Inherits  properties  from  both  lu  genus  and  speclalizer,  where  context  determines  the 
appropriate  Interpretation.  Thus,  one  may  firu  open  and  then  eat  a can  of  beans,  first 
opening  the  can  and  then  eating  the  beans. 

The  above  Is  a short  sketch  of  our  approach  to  representation.  A much  more 
complete  treatment  will  be  found  In  [Martin  1977]. 

IH.C  Parting 

To  transiale  from  strings  of  English  words  to  their  representation,  we  use  an 
augmented  transition  network  parser  based  on  [Woods  19701  The  OWL  parser  uses  no 
registers  but  maintains  a consUtuent  stack  of  concepa  with  each  phrase  for  which  a 
transition  network  (TN)  U being  followed.  On  every  arc  is  an  OWL  concept  which  must 
be  matched  for  that  transition  to  apply  and  a set  of  combining  functions  which  manipulate 
the  matching  concept  and  constituent  stack. 


It  Is  the  task  of  the  combining  functions  to  compose  OWL  concepu  representing 
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parts  of  « phrase  Into  the  concept  representing  the  whole  phrase.  The  role  of  the  TN  U to 
invoke  the  combining  functions  in  the  appropriate  sequence.  The  parser  operates 
nondeterminlstically  (via  backtracking).  Failure  leading  to  backup  may  occur  either  because 
the  input  string  fails  to  meet  word-order  constraints  (i.e.,  no  match  can  be  found  for  any  arc 
from  a non-terminal  node  of  a TN)  or  because  a combining  function  rejects  a proposed 
phrase.  The  conceptual  memory  contains  (expressed  via  attachment)  strictly  enforced 
constraints  on  case  slots  of  all  grammatical  concepts.  Using  these  constraints,  the  combining 
functions  control  all  compositions  such  as  adjectival  and  adverbial  modification  and  case 
assignment  for  verb  phrases.  The  word-order  constraints  of  the  TN*s  plus  the  ooncqX' 
formation  constraints  in  the  conceptual  memory  (as  they  are  used  by  the  combining 
functions)  thus  express  our  grammar. 

Two  mechanisms  of  special  interest  should  be  mentioned:  the  use  of  naming  to 
postpone  the  introduction  of  ambiguity,  and  bidding.  Because  many  English  words  and 
phrases  have  alternate  interpreutions  in  LMS  (e.g.,  our  "fat  man*  example),  if  we  were  to 
split  our  computation  nondetermintstically  every  time  alternative  interpretations  of  a phrase 
were  available,  we  would  spend  a lot  of  processing  effort  carrying  alt  those  interpretations 
along  until  all  but  one  could  be  eliminated.  Further,  if  more  than  one  interpretation 
succeeded  and  the  sentence  parsed  ambiguously,  we  would  have  a difficult  task  localizing 
the  cause  of  the  ambiguity.  To  avoid  these  problems,  we  take  a "wait  and  see"  approach^* 
and  try  not  to  choose  the  appropriate  interpretation  until  some  further  constraint  forces  that 


This  t«cliiii<|ac  ii  moiivaied  by  [Vslts  197t]  and  alto  appUad  in  a parser  by  Marens  [191S1 
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choice.  Postponing  the  choice  is  accomplished  by  use  of  the  naming  mechanism  introduced 
above  In  our  "fat  man"  example,  we  say  that  conventionally  we  will  form  the  restriction 
(MXNaR  FAT)  as  the  interpreution  of  the  phrase  and  we  will  have  in  the  knowledge  base  an 
indication  that  (HAN«R  FAT)  names  (NAN*T  FAT).  In  this  case,  the  distinction  may  never  have 
to  be  drawn  during  parsing,  since  no  grammatical  decisions  will  depend  on  it,  and  it  will  be 
some  later  step  of  reasoning  in  the  system  that  may  have  to  choose  the  "circus  perfornr>er" 
interpretation. 

In  a typical  situation  where  grammatical  distinctions  arise  early  in  parsing,  we 
take  a slightly  different  approach  from  the  previous  example.  The  word  "drinks”  is  either 
the  plural  of  the  noun  "drink,"  as  in  "We  had  a few  drinks,"  or  the  third  person  singular  of 
the  verb  "drink,"  as  in  "Joe  drinks  beer  at  dinnertime"  Here,  rather  than  choosing  one  of 
these  as  a primary  interpretation,  we  create  the  neutral  (DRINK*X  -S)  and  say  that  it  names 
both  (DR1NK«X  PLURAL-NOUN)  and  (0R1NK*X  THIRD-PERSON-SINGULAR-VERB).  To  make  this 
scheme  work,  every  combining  function  must  succeed  not  only  when  the  concepts  given  to  it 
may  be  directly  combined  but  also  when  any  concepts  named  by  the  given  ones  may  be 
combined.  Matching  of  concepts  on  TN  aru  is  similarly  augmented.  Further,  rules  like  the 
above  for  "drink"  generalize,  and  OWL  encodes  those  generaliutions  rather  than  specific 
naming  ru)es  for  each  concept^* 

Bidding  is  another  mechanism  for  deferring  a choice  among  alternatives  and 


Tk«M  naming  genaraliutions  are  called  productive  naming  nilea.  They  are  applied  by  the 
normal  inheritance  mechaniem  of  LMS,  lo  of  coerre  they  may  be  overridden  by  more  specific 
information  in  any  particniar  case. 
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avoiding  undua  nondetermlniim.  Iti  application  ii  bmt  Mvn  when  conildering  the 
atachment  of  prepositional  phrases.  For  example,  in  *I  rode  along  the  highway  in  my 
limousine,”  we  may  eliminate  'the  highway  in  my  limousine”  as  implausible  and  attach  the 
prepositional  phrase  to  the  predicate  (or  predication).  By  contrast,  in  ”I  liked  the  phone  in 
my  linwusine,”  the  prepositional  phrase  clearly  belongs  with  'phone”  We  cannot  always 
ntake  such  a deftnltlve  Judgment:  1 saw  the  man  beside  our  house”  places  either  me  or  the 
man  beside  tne  house.  From  further  context,  the  ambiguity  may  be  resolved:  ”As  I 
approached,  I saw  the  man  beside  our  house.”  We  treat  this  problem  by  suspending  a path 
in  parsing  at  a point  where  it  is  about  to  take  an  arc  transition  for  a prepositional  phrase 
until  all  possible  paths  leading  to  taking  such  a transition  for  that  same  phrase  are 
Identified.  Then,  a conflict-resolving  routine  Is  called  to  permit  any  number  of  the  possible 
interpreations  to  proceed.  That  routine  will,  in  general.  Invoke  the  Interpreter  to  try  to 
decide  which  interpretation(s)  are  best.  Its  success  will  depend  on  the  sophistication  of 
world  knowledge  in  the  conceptual  memory  and  on  the  existence  of  appropriate  strategies 
available  to  the  Interpreter  to  apply  that  knowledge.  A more  specific  mechanism  which 
similarly  addresses  the  problem  of  'selective  modifier  placement”  is  presented  in  [Woods 
I97S1  We  have  not  yet  made  any  significant  use  of  this  bidding  strategy- 

IV.  Reasoning 

We  nave  implemented  an  initial  version  of  an  lnt«rpr$trr  for  OWL,  which  is  the 
basis  of  the  system’s  ability  to  reason.  It  is  a large  program  with  many  interesting 
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capabilities,  of  which  we  will  here  describe  only  the  central  ones.  Sunguroff  [1976]  describes 
the  Implementation  details  of  the  current  version,  Brown  [1977]  Is  concerned  with  use  of  the 
Interpreter  for  dialog  and  the  handling  of  failure.  Long  D977]  gives  another  view  of  the 
Interpreter’s  use  for  automatic  programming,  and  Swartout  [1977]  discusses  the  Interpreter’s 
record-keeping  and  updating  capabilities  and  their  relation  to  explaining  program 
behavior. 


So  far,  we  have  Interpreted  OWL  concepts  as  static  entities,  mere  translations  of 
English  phrases.  The  system’s  action  when  given  the  sentence  Trescribe  an  appropriate 
dosage  of  Digitalis  for  Mr.  Jones”  cannot  be  merely  to  translate  that  sentence  into  Its 
internal  representation  and  then  stop.  But  how  is  It  to  know  what  the  procedural  nteanlng 
of  some  senteiKe  is? 


If  an  OWuf.  concept  has  a METHOD  aspect,  then  it  is  called  a PLAN  and  is  something 
which  the  Interpreter  can  carry  out.  When  the  Interpreter  is  cai/od  (its  argument  is  the  caiO, 
ft  performs  the  following  steps; 

1)  It  tries  to  match  the  call  to  a known  plan  in  the  knowledge  base.  The  search 

for  a nuitching  plan  proceeds  "upward"  from  the  call,  so  that  the  most 
specific  plan  which  matches  wilt  be  selected.^’ 

2)  It  checks  that  any  required  properties  on  the  cases  (variables)  of  the  plan  occur 

also  on  the  concepts  which  will  be  matched  to  them. 


This  is  a very  important  idea.  With  it,  we  can  embed  completely  specific  pisns  to  soire 
any  problems  which  we  know  will  arise  often  end  will  be  critical  to  the  system's  performance. 
We  also  nse  it  to  eipreu  plans  when  their  choice  is  dictated  not  by  a reasoned  choice  but  by 
convention  in  the  application  area.  If  a specific  plan  is  unavailable,  slightly  more  general 
plans  will  be  attempted,  and  only  if  all  such  plans  are  found  inapplicable  will  the  system 
resort  to  some  general  deductive  scheme.  We  have  noted  that  only  when  a great  majority  of 
specific  plans  for  a domain  is  available  will  the  system's  performance  be  at  an  "eipert"  levcL 
This  agrees  with  our  observations  that  human  experts  seem  to  have  large  portions  of  their 
ordinary  professional  behavior  "precompiled*  into  fixed  routines. 
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S)  It  crettes  a new  event,  to  record  the  initiation  of  execution  of  the  selected  plan, 
and  binds  all  the  matched  variables. 

4)  If  the  plan  contains  a PREREQUISITE  aspect,  it  checks  if  it  is  already  TRUE  and  if 

not,  then  it  tries  to  make  it  true.  This  subgoal  step  of  course  once  again  uses 
the  Interpreter. 

5)  It  carries  out  the  steps  of  the  METHOD,  either  in  parallel  or  in  sequence, 

whichever  is  specified. 


We  attempt  always  to  use  the  Interpreter  to  solve  subproblems  of  an  initial 
problem  so  that  the  general  matching  and  reasoning  resources  we  build  up  will  be  available 
at  all  levels.  For  example,  if  X is  a prerequisite  which  is  not  yet  satisfied,  we  merely  call  the 
Interpreter  with  the  call  (GET«T  X).  Classical  goal-directed  behavior  can  be  achieved  by  use 
of  the  PRINCIPAL-RESULT  case  on  a plan,  which  identifies  the  teleological  goal  of  the.  plan. 
Then,  if  a GET  is  unable  to  find  a plan  by  its  upward  search  of  the  concept  tree,  it  may 
search  for  a matching  principal  result  and  select  the  plan  which  promises  that  result  One 
other  important  aspect  of  the  Interpreter  is  that  after  every  step  of  interpreutlon,  it 
dispatches  to  its  next  step  through  the  main  top-level  loop.  There,  failure-handling  and 
advice-giving  procedures  may  always  be  invoked  to  redirect  the  course  of  computation  by 
"backing  off"  from  unproductive  lines  (if  they  can  be  recognized). 


We  are  continuing  to  refine  our  undersunding  of  the  representation  erf  English 
phrases  in  the  formal  notation  of  OWL  and  the  use  of  a complex  Interpreter  which  works 
within  that  formalism  to  perform  all  reasoning  tasks  which  arise  in  language  processing 
and  various  application  areas. 
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